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Maxwell's equations are considered in metric-free form, with a local but otherwise arbitrary consti- 
tutive law. After splitting Maxwell's equations into evolution equations and constraints, we derive 
the characteristic equation and we discuss its properties in detail. We present several results that 
are relevant for the question of whether the evolution equations are hyperbolic, strongly hyperbolic 
or symmmetric hyperbolic. In particular, we give a convenient characterisation of all constitutive 
laws for which the evolution equations are symmetric hyperbolic. The latter property is sufficient, 
' but not necessary, for well-posedness of the initial-value problem. By way of example, we illustrate 

^ our results with the constitutive laws of biisotropic media and of Born-Infeld theory. 
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I . The fact that Maxwell's equations can be formulated on a bare manifold that need not carry a metric or a connection 
^jrj' was first observed by Kottler [l[ and Cartan 0), and later also by van Dantzig [1] and Schroedinger Q, p. 24. It plays 
a central part in Post's [H] systematic study of the formal structure of electromagnetism. More recently, Hehl and 
Obukhov [6[ have studied the metric-free (or pre-metric) approach to Maxwell's equations and its physical implications 
in great detail. It is the philosophy of Hehl and Obukhov to consider electromagnetism as more fundamental than 
gravity. In their approach, Maxwell's equations are formulated on a bare manifold. The constitutive law which 
' connects the electromagnetic field strength with the electromagnetic excitation plays the role of a "space-time relation" . 
In other words, the space-time geometry, which governs gravity, is coded in the constitutive law. By formulating a 
particular, very special, constitutive law one recovers a Lorentzian metric (at least up to a conformal factor) and, thus, 
the ordinary general-relativistic theory of gravity. It is then very natural to speculate that more general constitutive 
laws, which lead to more general geometric structures, could be considered as more general (hypothetical) theories of 
gravity. In particular, such generalised theories of gravity typically predict birefringence for light rays in vacuo (i.e., 
under the influence of gravity alone.) 

The Hehl-Obukhov approach has some similarities, although more in philosophy than in mathematical technicality, 
with an idea of Newman and his collaborators (see, e.g. Q) who suggest to view the equation of wave fronts as 
fundamental for gravity. A characteristic feature (and, maybe, a drawback) of both approaches is that the Lorentzian 
metric that is supposed to describe gravity can be fixed only up to a conformal factor, i.e., up to a strictly positive 
but otherwise undetermined scalar function. 

The metric-free approach to Maxwell's equations gives a strong motivation for investigating which constitutive 
laws are physically reasonable and which ones are not. Here we want to discuss a criterion which is not mentioned 
in the book by Hehl and Obukhov: We want to characterise constitutive laws that yield a well-posed initial-value 
problem. A closely related property is the admittance of wavelike solutions, in a sense that is made precise in Section 
mil below. Our results apply to the case that the constitutive law is interpreted as a spacetime relation (i.e., as the 
vacuum constitutive law in a generalised spacetime theory), but also to constitutive laws in a medium on a standard 
general-relativistic spacetime. 

In contrast to Hehl and Obukhov, who restrict to local and linear constitutive laws throughout, we allow for 
nonlinear constitutive laws. However, we have to maintain the restriction to local constitutive laws which excludes, 
e.g., media with memory such as ferromagnets. The latter case would lead to integro-differential equations, for which 
an initial- value problem in the standard sense cannot be formulated, whereas a local constitutive law leads to first-order 
differential equations for appropriately chosen field components. In Section [H] we derive these differential equations 
and we decompose them into evolution equations and constraints. In Sections IIIII and IIVI we derive and discuss the 
characteristic equation. The real roots of the characteristic equation determine the directions into which wavelike 
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solutions can travel and, thereby, the "light cones" of the theory. Non-real roots are associated with "evanescent 
modes", i.e., with exponentially decaying solutions. In Section |V] we discuss the notions of hyperbolicity, strong 
hyperbolicity and symmetric hyperbolicity. Hyperbolicity requires that all roots of the characteristic equation are 
real, i.e., that evanescent modes do not occur. Strong hyperbolicity is a necessary and sufficient condition for the 
initial-value problem to be well-posed. Symmetric hyperbolicity is a sufficient but not a necessary condition for 
the initial-value problem to be well-posed. Having established these notions, we investigate some properties of the 
light cones for the case of hyperbolicity in Section |VT1 In Section IVIII we prove that the light cones are coordinate 
invariant which is not obvious from our derivation. The following three sections present some results that are useful 
for calculations: In Section IVlIII we discuss how the roots of the characteristic equation can actually be determined; 
m Section |IX] we demonstrate the invariance of the characteristic equation under certain changes of the constitutive 
law; and in Section |X] we derive an alternative form of the characteristic equation. The case that the light cones are 
invariant under temporal or spatial inversion is considered in Section IXIl and the case that there is no birefringence 
is considered in Section IXIII In Section IXIIII we characterise the class of all constitutive laws for which the evolution 
equations are symmetric hyperbolic. Finally, two examples are worked out in Section IXIVI biisotropic media and 
Born-Infeld electrodynamics. In the conclusions we summarise the results that have been achieved so far, and we list 
some important questions that are still open. 



II. MAXWELL'S EQUATIONS IN METRIC-FREE FORM 

We consider a 4-dimenional bare manifold, with coordinates x — {x'^ ,x'^ ,x^). We use Einstein's summation 
convention for latin indices running from to 3 and for greek indices running from 1 to 3. We refer to x'^ as to the 
time coordinate and to x^ as to the spatial coordinates. At present, this is just a convenient mode of expression. As 
we have no structure on our manifold, it does not make sense to ask whether the a;"— lines are timelike or whether 
the hypersurfaces = constant are spacelike. Later, however, we will discuss the question of whether initial values 
for the evolution part of Maxwell's equations on the hypersurfaces x'^ ~ constant determine a unique solution on an 
appropriate neighborhood. If this is true, one might view the covector dx'^ as "timelike" , in a sense determined by 
the evolution equations and not by a background structure. The covectors which are timelike in this sense turn out 
to form an open convex cone at each point of the manifold, see Section [VII below. 
In standard index notation, Maxwell's equations read 

d[aFbc]{x) = Mabc{x) , d[aHhc]{x) = Jabc{x) , (1) 

where the square bracket denotes antisynimetrization. Here Fab = ^^6a is the electromagnetic field strength, Hab — 
—Hba is the electromagnetic excitation, Jabc is the electric current and Mabc is a hypothetical magnetic current. (On 
physical grounds, there is good reason to assume that the latter is zero; however, we take it into account for the sake 
of generality.) Under coordinate transformations, these fields change according to 

- dx'^ dx'^ ~ dx"^ dx'^ dx-f 



fr _ det(fl) dx^^dx^ ~ det(fl) dx'^ dx^ dxf 

- |det(f|)| dx- dx" "^"^ ~ |det(f|)| dx- dx" dx- ' ^ ' 

Thus, in the terminology of de Rham Q, Fab and Mabc are even differential forms whereas Hab and Jabc are odd 
differential forms. (An even differential form is the same as a totally antisymmetric covariant tensor field whereas an 
odd differential form is the same as a totally antisymmetric covariant pseudotensor field.) 

We assume that the electric current Jabc and the magnetic current Mabc are given by equations of the form 

Jabc{x) = jabc{x,F{x),H{x)) , M abc{x) = m abc{x , F {x) , H {x)) . (4) 

Here it is essential that the values of the currents at x depend on x and on the values of field strength and excitation 
at X, but not on their derivatives. We will soon see that, under this assumption, the currents are irrelevant for the 
question of whether the initial value problem is well-posed. Therefore, they will play no role in our further discussion; 
we have allowed for non-zero currents only for the sake of generality. Note, however, that the functions jabc and mabc 
in ((4]) are not completely arbitrary. They must be consistent with the conservation laws 

d[aJbcd\{x) = 0, d[aMi,cd]{x) =0, (5) 
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which are a consequence of ([T]). 

In addition to @ we assume that we have a constitutive law in the form of six scalar equations 

TA{x,F{x),Hix)) =0 , A^l,...,6 (6) 

which allow to express six of the twelve independent components Fab{x) and Hab{x) in terms of the remaining six. 
(We shall later specify the six components which are to be eliminated.) Again, it is essential that the constitutive 
law is local in the sense that knowledge of six components at a particular point x allows to express the remaining six 
components at this particular point x. In particular, it is essential that ([5]) does not involve derivatives of the field 
components. 

We now separate the 8 equations ([T]) into two constraints 

d[p.F^c]{x) ^ J,,va{x) , d[f,Hi,„]{x) = AIf,^„{x) , (7) 
which do not contain any do derivative, and six evolution equations 

dloF,ya]{x) = Joua{x) , 5[o-ff,ya] (x) = M(iua{x) , (8) 

which do contain 9o derivatives. For the well-posedness of the initial-value problem, only the evolution equations are 
relevant. The constraints restrict the allowed initial values. After solving the evolution equations with initial values 
that satisfy the constraints, one has to check whether the constraints are preserved. This is guaranteed if the currents 
satisfy the conservation laws ([5]). 
To link up with standard notation of electrodynamics, we decompose field strength and excitation in electric and 
magnetic parts. 

FuO — En , F^p £f_ipaB , 

F^uo T~Li, , F^^p ^^paF) , 

where e^pa- is the 3-dimensional Levi-Civita symbol, defined by the properties that it is totally antisymmetric and 
satisfies £123 = 1. 

This puts the constraints ([7]) into the form 

dpDP + . . . = , dpBP + . . . = , (10) 

where the ellipses indicate terms that do not involve derivatives of the fields. The evolution equations © can be 
conveniently written in six-vector form. To that end, we write E and H for the three-column vectors with compo- 
nents Ei,E2,E3 and 7^1,7^2,^3, respectively, and we write D and B for the three-column vectors with components 
D^,D^,D^ and B^,B^,B^, respectively. Then the evolution equations ([8]) read 

*(l)-(AVf)m)....^0. 

Here the 3x3 matrices A'' are defined by 



A^ = 1 , A^ = , A^ = -1 (12) 






and, as in ()10|) . the ellipses in ([Tl]) indicate terms that do not involve derivatives of the fields. 

We shall now require that the constitutive equations ([6]) can be solved for E{x) and H{x). If this is the case, 
we can eliminate E and H from (jll[) with the help of the constitutive equations, and we are left with a set of six 
first-order differential equations for the six dynamical variables B and D. If, on the other hand, the constitutive 
equations cannot be solved for E{x) and 'H{x), the number of evolution equations does not coincide with the number 
of independent dynamical variables, so there is no chance to get a well-posed initial-value problem. We introduce the 
following terminology. 

Definition 1. A coordinate system is called admissible if, in this coordinate system, the constitutive law ([5]) can be 
solved for E{x) and 'H{x). A constitutive law is called regular at x if there is an admissible coordinate system on 
some neighborhood of x. 



4 



Henceforth we assume that we have a constitutive law that is regular at some chosen point, and we work in an 
admissible coordinate system defined on some open neighbourhood U of this point in M . We will see in Section IVIII 
that then, if U is chosen sufficiently small, almost all other coordinate systems on U are admissible as well, and we 
will investigate the behaviour under coordinate changes of all relevant quantities. 

The assumption that (O can be solved for E{x) and T-L{x) results in equations of the form 

dpE^{x) = K^r{x,D{x),B{x))dpD^{x) + xA^,D{x),B{x))dpB^{x) + ... (13) 



dpn^,{x) - -ipr{x,D{x),B{x))dpD^{x) + v^r{x,D{x),B{x))dpB^{x) + ... (14) 

Here, as before, the ellipses indicate terms that do not involve derivatives of the fields. In the more particular case 
that the constitutive law is linear, the coefficients k^,-, Xiit, Iilt and Vp^r depend only on x but not on the fields. In 
the following we denote by k, v, x and 7 the 3x3 matrices with components Upr, ^'^t, X/jt and 7^7-, respectively, k 
is called the impermittivity matrix, u is called the impermeability matrix, and x and 7 are called the magneto- electric 
cross-terms. (Our notation follows Kong 0, [iflj.) The standard text-book formalism of electrodynamics is recovered 
if we assume that x and 7 vanish and that k and i> depend only on x and are invertible. Then e = is called 
the permittivity (or dielectricity) matrix and /x = is called the permeability matrix. A priori, however, there is 
no reason to assume that we can choose our coordinate system such that the magneto-electric cross-terms vanish and 
that K and v are invertible. For a detailed discussion of media with magneto-electric cross-terms see O'Dell [TTj . 
The four 3x3 matrices k, v, x and 7 can be combined into the 6x6 matrix 

M = (; (15) 

which we call the constitutive matrix. 

With and (ITU) inserted into (fTTj) . we get the following set of six evolution equations for the six dynamical 
variables D and B. 

do (I) -L^ap (^1) + ... = 0, (16) 

where the 6x6 matrix 

^ (a. -f) (; J) (") 

depends on x and, in the case of a non-linear constitutive law, also on D{x) and B[x). Thus, (|16p is a quasilinear 
system of partial differential equations with non-constant coefficients. 

Before we proceed further it is useful to add a remark on the fact that we had to solve the constitutive equations 
for E and rather than for any other combination of field components, as a necessary condition for having a 
well-posed initial-value problem. It is sometimes argued (see, e.g., O'Dell [TTI|, Section 2.1, or Hehl and Obukhov 
that one should solve the constitutive equations either for E and B, or for D and T-L, because only then has 
the resulting equation a covariant (i.e., four-dimensional, coordinate-independent) meaning. According to this point 
of view, constitutive equations solved for other combinations of the field components are "a historical artifact" and 
"should be phased out from use" jl2j. It is, indeed, true that the condition of solvability for E and H is not covariant. 
Nonetheless, it is precisely this condition which appears if we ask for a well-posed initial-value problem. This should 
not come as a surprise. The initial-value problem refers to a particular slicing of the spacetime into hypersurfaces 
= constant. It is not a problem that has a covariant answer; the initial-value problem is well-posed for some 
slicings, and not well-posed for others. So it is quite natural that non-covariant conditions play a role. 

If the constitutive law can be solved not only for E and % but also for E and B, the impermittivity matrix k must 
be invertible. The constitutive matrix (jl5p can then be written in the form 



\ ')(- ' '^"^1 (18) 

S**" l/\0 V --fK-^x) \ ^ 



as can be easily verified by multiplying out the right-hand side, implies 

dct(M) = det(7y -7/«"^x)det(/«) . (19) 
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Analogously, if u is invertible, we find 



1 y V v) xv-^-i 1 



M = c )(.':.)( ..-u. : ) (20) 



and hence 

det(M) = dct(K - xt^^^l) det(iv) . (21) 
Equations (|18p and pO| arc useful for calculating the inverse of M . 



III. APPROXIMATE-PLANE-WAVE SOLUTIONS OF MAXWELL'S EQUATIONS 

Now we want to derive the characteristic equation of the evolution equations (|16p and the resulting light cone structure. 
There are two quite different methods of how to do this. The first method, which goes back to Hadamard, investigates 
the directions in which discontinuities of the electromagnetic field can propagate. For linear constitutive laws on a 
bare manifold, this method is used in the book by Hehl and Obukhov 6]. The second method investigates the 
directions in which approximate-plane-wave solutions of Maxwell's equations can travel. This method was pioneered 
by Luneburg whose work is reviewed in the book by Kline and Kay 13]; their treatment is restricted to linear and 
isotropic constitutive laws on Minkowski spacetime. 

Here we want to use the second method because it provides us with a clear physical interpretation of the charac- 
teristic equation. As there are no treatments in the literature that cover our situation - Maxwell's equations with a 
local but possibly nonlinear constitutive law on a bare manifold -, we give a detailed and self-contained exposition. 
Our first task is to define the notion of an "approximate-plane- wave solution" . 

In standard electrodynamics on Minkowski spacetime, wave propagation can be studied in terms of plane harmonic 
waves. Maxwell's equations on a bare manifold do not admit plane-harmonic-wave solutions in general. However, they 
do admit such solutions in an approximative sense. To make this mathematically precise, we introduce the following 
terminology. 

Definition 2. An approximate-plane-wave family with background field D{x) , B{x) is a one-parameter family 

I = ( + aRcH ( + 0(a) ) exp ^5(a;)/a) } (22) 

V»(a,x); \B{x)J [^\bix)J > J 

with the following properties: 

(a) The coordinates x — (x", x^ ,x^ ^ x'^) range over an open neighborhood U of the manifold M and the parameter 
a ranges over the strictly positive real numbers, a G M^. 

(b) _D and B are valued C°° functions. 

(c) 5 is a real-valued C°° function whose gradient dS{x) = daS{x) dx°' has no zeros on U. We refer to S as to the 
eikonal function of the approximate-plane- wave family. 

(d) d and h are valued C°° functions with h{x)^ ^ (Oj O) fo'" ^ U^ 

If daS, d and h are independent of x (in the chosen coordinate system) and the 0{a) terms in are zero, is 
a background field with an a-dependent plane harmonic wave added; the wave covector of this plane harmonic wave 
is given by ka = daS/a. This observation gives the following interpretation to an arbitrary approximate-plane- wave 
family. On a sufficiently small neighborhood, daS, d and b differ arbitrarily little from constants, and for a sufficiently 
small the 0{a) terms give arbitrarily small contributions. Thus, on a small neighborhood and for small a, ([22]) can be 
approximately viewed as a plane harmonic wave added to the background field. The smaller a, the more oscillations 
we have on the chosen neighborhood. We refer to a as to the high-frequency limit. 

For the evaluation of approximate-plane-wave families the following simple lemma is crucial. 

Lemma 3. Let S be the eikonal function of an approximate-plane-wave family and let u be a complex-valued 
continuous function defined on the same neighborhood U as S. Then lim Re|w(a;)e*'^'-^^/"| exists for all x in U if 

and only if u{x) = for all x in U. 
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Proof. If u is different from zero at some point in [/, it is different from zero on an open subset V of U. For almost 
all cc in 1^ we have S{x) ^ 0, as dS has no zeros. As a consequence, for almost all xinV the limit does not exist. □ 

We now consider an approximate-plane- wave family, given in coordinates x = {x'^,x^,x'^,x^) that are admissible 
in the sense of Definition [1] We assume that the background fields D{x) and B{x) satisfy the constraints (ITUl) and 
the evolution equations (ITCl) . We then say that the approximate-plane- wave family is an N^^ order asymptotic 
solution to the constraints (1101) if 




and to the evolution equations (|16l) if 

{ ^ (* £(!:!)) - ^'(-^<-)'^<-))«» - it) ^ <^^> 

Here the ellipses stand for the same terms as in PU)) and respectively. It is obvious that an N^^ order asymptotic 
solution is automatically an M^^ order asymptotic solution for all AI < N. We want to investigate the lowest non- 
trivial order = 0, which is known as the geometric optics approximation. (We will not consider asymptotic solutions 
of higher order, which allow to determine the 0(a) terms in ([22]) iteratively.) Then in ([23| and (|24l) the terms indicated 
by ellipses, which do not contain derivatives of the fields, give no contribution to the limit. Using Lemma [31 we find 
that an approximate-plane- wave family (j22[) is an asymptotic solution of order A'^ = to the constraints if and only if 

dpS{x)dP{x) = 0, dpS(x)bP(x) = 0, (25) 

and to the evolution equations if and only if 

dpSix)LP{x,Dix),B{x)) (^f^^^ = doSix) (^f^^^^ . (26) 

As the amplitudes are assumed to be non-zero, ([^5]) means that doS{x) must be an eigenvalue of the 6x6 matrix 
dpS{x)-LP{x,D{x),B{x)), i.e. 

det (^doS{x)l - dpS{x)LP{x,D{x),B{x))^ = 0. (27) 

(|27p is the eikonal equation of the evolution equations. It is a first order partial differential equation for S. Its 
coefficients depend not only on x but also on the background fields D{x) and B{x), unless we restrict to linear 
constitutive laws. 



IV. THE CHARACTERISTIC EQUATION 

Now we choose values for x, D{x) and B{x) and keep them fixed. We want to investigate the condition that there 
is a nonzero (d, 6) such that (^5)) and (^5]) hold. This is an algebraic condition on the covector dS = daSdx°', 
i.e., it determines a subset of the cotangent space at x. We will call any covector which satisfies this condition a 
"characteristic covector" . Hence, a covector is characteristic if and only if it is the gradient of an eikonal function of 
an approximate-plane- wave solution of order A^ = 0. The precise definition reads as follows. (As x, D{x), B{x) are 
now kept fixed, dependence on these quantities is no longer made explicit to ease notation.) 

Definition 4. A covector Padx'^ is called characteristic if there is ( 5, 6) 7^ ( 0, O) in such that 

PpdP = Q, PpbP = 0, (28) 



bj " 

The set of all characteristic covectors is called the characteristic variety. 



PpLP [i] = Po ((] ■ (29) 



7 



We will show in Section fVlII that this definition is coordinate-independent. Clearly, a necessary condition for Padx"" 
to be characteristic is that it satisfies 

det(pol - PpL") = 0. (30) 

(|30p is called the characteristic equation and its left-hand side is called the characteristic polynomial of the evolution 
equations. Note that (|30|) makes sure that (|29p admits a non-trivial solution but does not take ([28l) into account. 

By writing covectors as Padx'^ we have introduced canonical momentum coordinates Pa in each cotangent space 
which are conjugate to our admissible coordinates x"" . If can be interpreted as a temporal coordinate and the x^ 
as spatial coordinates, we refer to po as to the frequency and to p as to the spatial wave covector (for dimensional 
reasons, one may put in a factor h); here and in the following, p stands for the three-column vector with components 

Pl,P2,P3- 

The left-hand side of (pO)) is a sixth order homogeneous polynomial in the variables Pa- For each p e M^, it is a 
sixth order polynomial in the variable pq. As such, it has six complex roots which will be denoted by wa(p) where 
A = 1, ... ,6. In this notation, the characteristic equation (15(11) takes the form 



n {po-uja{p}) - 0. (31) 



A=l 



The iOAip) the eigenvalues of the matrix PpL'' , 



In general they are complex, and so are the components of the eigenvectors dA{p) and hA{p). Only real uja{p) 
are related to approximate-plane-wave solutions, because the eikonal function is supposed to be a real function. 
(Non-real eigenvalues are associated not with oscillating modes but rather with so-called evanescent modes, i.e., with 
exponentially decaying fields.) From the real solutions uja{p) of the eigenvalue problem we have to single out 
those for which the eigenvectors satisfy the constraints this will give us all the characteristic covectors in the 

form ujAipjdx'^ +p^dx^. 

If p runs over K'^ \ {0}, PpA.^ runs over all non-zero antisymmetric 3x3 matrices. Any such matrix has a one- 
dimensional kernel. As we can read from P7)) . this implies that the kernel of the matrix PpL^ must be at least 
two-dimensional, hence po = is an eigenvalue of algebraic multiplicity > 2 and the characteristic equation is of the 
form 

4 

pI H {po-iOA{p)) = 0. (33) 

The eigenvalues uJAip), for A — 1, ... ,4, are in general complex and some of them may be zero as the algebraic 
multiplicity of the eigenvalue po — may be bigger than 2. If M is invertible, the kernel of the matrix PpL'' is 
precisely two-dimensional, hence po = is an eigenvalue of geometric multiplicity equal to 2; however, even in this 
case the algebraic multiplicity may be bigger than 2. 

We will now show that, if wa(p) 0, the corresponding eigenvectors automatically satisfy the constraints (|28p. 

Proposition 1. If p2)) holds with uja{p) 0, the dA{p) and bA{p) satisfy d''^{p)pp ~ b'\{p)pp = 0. 
Proof. The eigenvalue equation p2|) implies that, for all complex numbers a and h, 



where the dot denotes the standard scalar product in C^. Using the antisymmetry oi PpA^ , this can be rewritten as 



As p spans the kernel oippA^ , the left-hand side vanishes, so the right-hand side has to vanish for all complex numbers 
a and h. As liJa{p) is non-zero, this completes the proof. □ 
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One may interpret the constraints (j28l) as saying that d and b must be transverse. By Proposition [U this transver- 
sahty condition follows already from the evolution equations for modes with non-zero frequency. It is interesting to 
note that such a transversality condition does not hold for e and ft,; on a manifold without a metric, trans versality 
cannot even be formulated for these fields because one would have to raise an index. This is another indication that 
the choice of D and B as the dynamical variables is the natural one, cf. Kong and T^, Section 3.3, where an 
analogue of Proposition [1] is discussed for the special case of homogeneous linear media on Minkowski spacetime. 

If the algebraic multiplicity of zero, as an eigenvalue of PpL^, is equal to 2, Proposition [1] guarantees that a real 
covector Padx"' is a characteristic covector if and only if it satisfies the reduced characteristic equation 

II {po - loa{p)) = — det(pol - PpL") = 0. (36) 

A=l Po 

Whereas (pO| is the "characteristic equation of the evolution equations" , (p6|) can then be properly called the "char- 
acteristic equation of the full Maxwell equations" (evolution equations plus constraints) . 

The situation is very much more inconvenient if, for some p £ \ {0}, the algebraic multiplicity of zero, as an 
eigenvalue of PpL'', is > 2. In this case one of the four roots tOAip) of the reduced characteristic equation is zero. 
The corresponding covector u!a{p)dx^ + p^dx^ may or may not be characteristic, i.e., the eigenvectors may or may 
not satisfy the constraints. If they do, we have an approximate-plane-wave solution of Maxwell's equations of order 

= with zero frequency. One would conclude from this observation that cannot be interpreted as a temporal 
coordinate. Therefore it seems reasonable to discard such cases as unphysical, i.e., to restrict to cases where the 
coordinates can be chosen such that zero-frequency modes do not occur. 

If K is invertible, it is convenient to write the reduced characteristic polynomial (j36p in the form 

4 

II {po ~ uja{p)) ^ det{K)g-'-^paPbPcPd (37) 

so that the characteristic equation reads G"^'"^'^PaPbPcPd — 0. The reason for introducing the factor det(/t) is that the 
coefficients g°-^''d. transform like a tensor density, as will be proven in Section fVIII Note, however, that in the case 
of a non- linear constitutive law the g°-^''d. depend not only on x but also on D{x) and B{x). Following Hehl and 
Obukhov ^6j] we call ty'''"^'* the Tamm-Ruhilar tensor density. For an interesting representation of the Tamm-Rubilar 
tensor density, using the adjugate (or classical adjoint) of a matrix, sec Itin [14]. 

Before discussing the (reduced) characteristic equation in more detail, we add a word on terminology. We have 
used the term "characteristic equation" which is the standard notation in texts on partial differential equation. In 
physics texts one finds the alternative term dispersion relation. Some authors, e.g. Hehl and Obukhov [6|, use the 
term Fresnel equation as another alternative. Traditionally, the term "Fresnel equation" is used in crystal optics for 
an equation that determines the index of refraction (or, equivalently, the phase velocity) in dependence of the spatial 
direction, see e.g. Born and Wolf [l^. Sect. 15.2.2. This equation is, indeed, equivalent to the characteristic equation. 



V. HYPERBOLICITY OF THE EVOLUTION EQUATIONS 

In the preceding section we have seen that the characteristic equation pop . together with the constraints, determines 
the directions Padx"" in the cotangent space in which wavelike solutions can propagate. We will now discuss that 
the characteristic equation pop also contains all information that is necessary to decide whether or not the evolution 
equations determine a well-posed initial- value problem. 

Recall that we denote by u)a{p)j for A = 1, . . . , 6, the six eigenvalues of the matrix PpL'' which depends on D{x) 
and B(x). Two of these eigenvalues are zero, ws = = 0, the other four are, in general, complex. We now recall 
some standard terminology from the theory of partial differential equations. 

The evolution equations are called hyperbolic if all eigenvalues ijJa{p) are real, for all p in R'^. (This is the case 
if and only if, in the terminology of Carding (161] . the characteristic polynomial det(pol ~ Pp^'') is hyperbolic with 
respect to the covector dx^ .) The evolution equations are called strongly hyperbolic if for each p there is an invertible 
matrix S{p) such that S{p)^^ppJjPS(p) is symmetric. They are called symmetric hyperbolic if this is true with an S 
that is independent of p. Obviously the following implications hold: symmetric hyperbolic ^ strongly hyperbolic 
hyperbolic. 

Of course, the answer to the question of whether any of the three properties - hyperbolic, strongly hyperbolic or 
symmetric hyperbolic - holds, may vary in dependence of x, D(x) and B{x). 
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Hyperbolicity guarantees the unrestricted existence of approximate-plane-wave solutions. However, it is too weak 
to guarantee well-posedness of the initial-vahie problem for the class of differential equations - quasilinear with non- 
constant coefficients - to which our evolution equations belong. 

The latter requires strong hyperbolicity. More precisely, if the evolution equations are strongly hyperbolic at some 
X, D{x) and B{x), the following holds true. Data for D and B on the hypersurface a;° = constant that take the 
prescribed values D{x) and B{x) at x determine a unique solution to the evolution equations on some neighborhood 
of X, and the solution depends on the data continuously. The data must be of Sobolev class H'^, for some s > 3, and 
continuity is meant with respect to the Sobolev i? norm. For details and proofs the reader is referred to Taylor il7i] , 
Theorem 5. 2D. (Note that Taylor uses the term "symmetrizable" instead of "strongly hyperbolic".) 

In the more special case of symmetric hyperbolicity we have not only continuous dependence of the solution on the 
data but in addition we can control the growth of the solution in terms of energy inequalities. Usually symmetric 
hyperbolicity is easier to check than hyperbolicity or strong hyperbolicity. In Section IXIIII below we will give a 
convenient characterisation of all constitutive laws that give symmetric hyperbolic evolution equations. For hyperbolic 
or strongly hyperbolic evolution equations, no such characterisation is known so far. 

Lindell, Sihvola and Suchy [Tsj . in an otherwise very useful article, claim that the eigenvalues uja{p) are real whenever 
the constitutive matrix M is symmetric. This would give a very convenient sufficient condition for hyperbolicity. 
Unfortunately, the claim is wrong; a counter-example is 

M = (; J) (38) 

for which the matrix Pph^ has eigenvectors ujiip) = (^2{p) = ^sip) = W4(p) = —i\p\ and uj5{p) = uJe{p) — 0. The 
error comes in eq. (65) of [31 where the authors divide by a quantity without paying attention to the fact that this 
quantity may be zero. (Note that Lindell, Sihvola and Suchy allow the constitutive matrix to be complex, in contrast 
to the formalism used here, and that their M is our M^^. In the above argument we specified their reasoning to the 
case that M is real and we used the obvious fact that an invertible real matrix is symmetric if and only if its inverse 
is symmetric.) 



VI. THE LIGHT CONES 



The homogeneity of the characteristic polynomial implies that the functions loa are positively homogeneous of degree 
one, 

u}a{sp) = suja{p) for all s > . (39) 

We now assume hyperbolicity, and we order the 4 roots of the reduced characteristic polynomial according to 

^^i{p) > ^2{p) > ^^sip) > ^i{p) ■ (40) 

This guarantees that the uja are continuous, but not necessarily smooth, functions. Thus, the reduced characteristic 
equation defines four connected sets 

Ca = {Padx''\pn=iUA{p)} , A = 1,2,3,4, (41) 

in the (real) cotangent space of the chosen point x. Without our assumption of hyperbolicity one would have to 
consider the Ca as subsets of the complexified cotangent space. 

By p9p , each Ca is a cone in the sense that it is invariant under multiplication with positive real numbers. We refer 
to the Ca as to the four branches of the characteristic variety or, shorter, as to the four light cones. By differentiating 
the characteristic equation we find immediately that the differential of pq ~ uja{p) is non-zero at any point of Ca 
where Ca does not meet one of the other light cones. Thus, Ca is a 3-dimensional manifold at any such point. At an 
intersection point with some other light cone Cb, however, Ca need not be smooth. For instance, Ca and Cb may form 
a "conical singularity" , with pointed tips meeting head-on. This gives rise to the phenomenon of "conical refraction" 
whose observability has been a matter of vivid debate in the history of anisotropic optics; for a detailed discussion 
see e.g. Born and Wolf ISj, p. 813-818. 

Our ordering PO)) implies that C4 is the image of Ci and C3 is the image of C2 under reflection at the origin in the 
cotangent space. This follows immediately from the fact that the characteristic polynomial is homogeneous. Thus, 
Ci U C4 is a double-cone in the sense that it is generated by straight lines through the origin, and so is C2 U C3. Note, 
however, that in general there is no reflection symmetry with respect to the plane po = 0- 
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We will now show that the four light cones divide up into two past cones and two future cones, provided that the 
reduced characteristic polynomial has no zero roots. In the terminology explained in Section [IVI the latter condition 
means that we prohibit zero-frequency modes. 

Proposition 2. Assume that the four roots of the reduced characteristic polynomial are real and non-zero for all 
p 7^ 0, and that we order them according to PO)) . Then 

Wl(p)>W2(p)>0>W3(p)>W4(p) (42) 

holds for all p 7^ 0. 

Proof. Assume that three of the four roots uja are positive at some p. Owing to the homogeneity of the characteristic 
polynomial, three roots must be negative at —p. However, as the loa are continuous on M"^ \ {0}, this is possible 
only if some loa has a zero somewhere on M'^ \ { 0} which contradicts our assumption. We have thus proven that it 
is impossible that three roots are positive. By the same token, it is impossible that three roots are negative, so we 
must have two positive and two negative roots, i.e., (|42p must be true. □ 

From this proposition we find the following Corollary. 

Proposition 3. Assume that the evolution equations are strongly hyperbolic and that the constitutive matrix M is 
invertible. Then (gS]) holds for all 6. 

Proof. What we have to prove is that our assumption prohibits zero- frequency modes, i.e., that pq — has algebraic 
multiplicity equal to 2 as an eigenvalue of ppL'' for all py^O. We have aleady observed in Section HVl that invertiblity 
of M guarantees that the geometric multiplicity of po = is equal to 2. Strong hyperbolicity makes sure that algebraic 
and geometric multiplicity coincide. □ 

Whenever we have hyperbolicity, Ci and C4 are the boundaries of convex open cones 

Zi = {pa dx" \po>LJi{p)}, = {pa dx" \ po < uja{p) } . (43) 

The convexity of Zi and Z^ is a general feature, following from hyperbolicity, as was already proven in Garding's 
pioneering paper [la|. By contrast, C2 and C3 are not in general the boundaries of convex sets. One may call covectors 
in Zi "future-pointing timelike" and covectors in Z^ "past-pointing timelike" . Of course, future and past interchange 
their roles under reflection of the coordinate x^ . The fact that the light cones, as subsets of the cotangent space, are 
independent of the chosen coordinate system will be proven in Section [VIII 



VII. COORDINATE TRANSFORMATIONS 



We have worked in a chosen admissible coordinate system throughout, and we will now investigate to what extent 
the results found are invariant with respect to coordinate transformations. In particular, we will verify that the light 
cones are invariant (i.e., coordinate- independent). 

As the hypersurfaces x'^ = constant play a distinguished role, it is useful to distinguish coordinate transformations 
that leave these hypersurfaces invariant. The most general such transformation induces on each cotangent space a 
linear transformation of the form 

dx^' = aPp{dxP + vPdx°) , dx° = cdx^ (44) 
and on each tangent space the dual linear transformation 

dxP " dxP ' 9i0 c ^ dx^ dxp ' ' ^ ^ 

Here a = (a^p) is an invertible 3x3 matrix, b = {b,_/) is the transpose of its inverse (i.e. ab-^ = b^a = 1 or, in index 
notation, a^p b^P = hpP^aPi, = 5^), (u^, u^, w^) is a real 3-tuple and c is a non-zero real number. We call (|44|) and (j45|) 
a generalised Galilean transformation. It reduces to a standard Galilean transformation if (a^p) is orthogonal and 
|c| = 1. An arbitrary coordinate transformation induces on the cotangent and tangent spaces linear transformations 
that can be written as a generalised Galilean transformation (|44l) and (|45|) followed by a transformation of the form 



dx^ = dx^^ , dSp = dxP + Uadx" , 



(46) 
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d d d d d 

where {ui,U2,U3) is an arbitrary real 3-tuple. 

From the transformation behaviour ([2|) and ([3]) of field strength and excitation we can calculate the transformation 
behaviour of the fields E, B, % and D as defined in Q, and thereupon of the constitutive matrix whose components 
are defined by (fT3| and (fT4|) . With respect to generalised Galilean transformations (j44|) and ((45|l . we find 

= -^K-'{Er+vPepraB") , = (det(a))"'a^S^ 

~ \c\ |det(a)| '"^"^ \c\ |det(a)| ° ' 



which yields, after some elementary algebra, the following transformation rule for the constitutive matrix. 

(49) 



n x\ _ det(a) / h{x + v^A^)h^ 



Here the antisymmetric matrices Ap = A'' are defined by ()12p . By (|49l) . M is well-defined whenever M is. Hence, a 
generalised Galilean transformation transforms admissible coordinate systems into admissible coordinate systems. 
With these results at hand, and with the transformation of the canonical momentum coordinates 

= b/Pp ' -Po = ^{po- v^Pp) , (50) 
we can now calculate the transformation behaviour of the eigenvalues and eigenvectors of PpL''. 

Proposition 4. Under a generalised Galilean transformation and P5)) . the eigenvalues and eigenvectors ((5^ of 
PpT-if transform according to 

i^a{P) = ^ {uja{p) -yPpp) , (51) 




det(a)-f^ (52) 
\0 a/ V bA{p) J 



for A = 1,2,3,4. 



Proof. The transformation behaviour (|49p of the constitutive matrix, together with (|50p . allows us to calculate the 
transformed matrix Ppli'', 

-, / c det(a) , fx A 

aO\-^f, /-aOX 1 fO -A'\ wSl" X + vA, 



Now assume that (I32p is true. This eigenvalue equation can be equivalently rewritten as 

(c dct(a) 
|cdct(a)| 1^ ^ 
7 j^i-J V bMPl I \ bApl 

With u}a(p), d-AiP) and 6a (p) introduced by ([5T|) and ([5^ . we find from ([55]) and ([Si]) , with the help of Propositon[TJ 
that ([5^ holds with all terms twiddled. □ 

From this proposition we read that 

Ppdx'^ + CjA{p)dx'^ — Ppdx'^ + ujAipidx'^ (55) 
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which demonstrates that the Hght cones are invariant. The reduced characteristic polynomial transforms as 

4 4 



Y[ {po - (^AiP)) ^ ^ tiiPa- ^a{p)) ■ (56) 

With 



c 

A=l A=l 



det(K) = det(K) , (57) 



which follows from (jlH]), this implies 



1 4 1 1 

d^ n - -a(p)) = ^ n (po ^AiP)) ■ (58) 

This shows that under generalised Galilean transformations the introduced in ([BT)) . transform as a tensor density 
of weight 1 , 



det(ll) I ^''^^^p.PbPePrf. (59) 
We now turn to transformations of the form and (HT)) . The fields change according to 

(60) 

where e^^"^ is the contravariant Levi-Civita symbol, defined by the properties that it is totally antisymmetric and 
satisfies e^^^ = 1. From this we find the following transformation behaviour of the constitutive matrix. 



(61) 



Thus, a coordinate transformation of the form (j46l) and (1471) maps admissible coordinates into admissible coordinates 
if and only if the inverse matrix on the right-hand side of (j6ip exists. This is true for almost all values of (ui, U2, 1*3), 
namely whenever dx'^ + UpCLx^ is non-characteristic. 

From (1611) and the transformation behaviour of the momentum coordinates, 



P^L ^PtJ.-Uf,po , Po=Pq , (62) 
we find the transformation behaviour of the characteristic matrix, 

Pol - ppl" = {pol - PpL") (1 - u^i^y' . (63) 

This demonstrates that the characteristic equations det(pol — PpL'') = and det(pol — PpL'') = are equivalent, 
i.e., that the light cones are invariant with respect to coordinate transformations of the form (|46p and (j47l) . To verify 
the transformation behaviour of the Tamm-Rubilar tensor density, we assume that H and M are invertible. Then 
([CT|) takes the form 



M-i = M-i - ( ^ ) . (64) 



-A'' 
A'' 

After calculating the inverse matrices M"-'^ and with the help of ([T5)) . the lower right-hand block of ([M)) yields 

i'-jf^^^X = i'-lii'^^X, (65) 

which, by ([TO]) , implies 

det(M) det(K) = det(M) det(K) . (66) 
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On the other hand, comparison of (|5T|) and ([55)) yields 

det(pol - PptP) det(M) = det(pol - PpL'') det(M) 
and thus, after dividing by p§ — pQ and using (155)) . 



(67) 



Y[ {Po - <^a{p)) det(K) = II (Po - (^aIp)) det(K) 



(68) 



A=l 



This shows that, also with respect to transformations (|46)) and (|47| . the of (|37| transform as a tensor density 

of weight 1, 



^abcd - 



Pa Pb Pc Pd 



det 



\dx) 



^abcd ~ ~ ~ ~ 



Pa Pb Pc Pd 



(69) 



where in this case the determinant on the right-hand side is equal to 1. 



VIII. CALCULATING THE ROOTS OF THE CHARACTERISTIC EQUATION 

In this section we will discuss the question of how to calculate the roots of the characteristic equation if the constitutive 
matrix is given. 

u}i{p), i02{p), u!3{p) and uJiip) are, together with 0^5 (p) = uje{p) = 0, the six eigenvalues of the matrix PpL^. In 
general they are complex. Now the sum of all eigenvalues is the trace, the sum of the squares of the eigenvalues is the 
trace of the square, and so on. (This is obvious in the case of a diagonizable matrix. It is also true in general, as can 
be seen from the Jordan decomposition theorem.) Thus, if we define 

LP^-P^ =trace(L(''i •••L"')) , (70) 
where round brackets around indices mean symmetrization, we get 

PpLP = uji{p) + i02(p) + ojsip) + i^i{p) , 
PppM'' = u:iW + Mp)^ + + Mp)^ , 

PpPaPrLP^^ = C^l(p)' + C^2(P)' + C^3(P)' + '^M'' , 
PpP.PrPxLP''^^ = C^l(p)* + UJ2{P)^ + UJsip)^ + UjM^ . 

If the constitutive matrix is known, L'' can be calculated from (flT)) and the LP^"'P^ can be calculated from ()70p . 
Then, the four equations ()7ip of (maximal) order four determine the four roots uja{p)- In general, solving fourth- 
order equations leads to rather awkward expressions. For several special cases, however, this method allows to calculate 
the roots of the characteristic equation in a convenient way, as will be demonstrated below. 

The transformation behaviour ()5ip of the uja{p) implies the following transformation behaviour of the LP^'''^^ under 
generalised Galilean transformations (H^ and (H5)) . 



cppLP^Pp{LP + 4vP) , 



2 PpPaLP" = ppp„ [LP'' + 2LPv'' + AvPy" 



(72) 

PpPaPrLP"^ = PpPaPr {LP"'' + iLP" + iLPv^v^ + AyPy^v') , 
c^PpP^PrPxLP"^^ = PpPaPrPxiLP"''^ + ALP"'' + QLP" v'' + ALPv"v''v^ + AvPv''v''v^) . 

As an alternative, one can derive (|7^ by multiplying each side of (|551) sufficiently often with itself and then calculating 
the trace. However, this is much more tedious than using ()5ip . 

From the first equation of (|72)) we read that, by a generalised Galilean transformation with yP = —jLP, it is always 
possible to transform LP to LP = 0. Also, we read from (|72)) that with respect to purely spatial transformations 
{yP = 0, c — 1) the LP^"'P' behave as contravariant tensor components. 

The characteristic equation is uniquely determined by the coefficients LP , LP" , LP""' , LP""'''^ . As they are totally 
symmetric, these are (3 -|- 6 -|- 10 -|- 15) = 34 independent real numbers. On the other hand, the constitutive matrix 
has (6x6) = 36 independent components. From this observation it follows that different constitutive matrices must 
yield the same characteristic equation. It is an interesting and important problem to find a necessary and sufficient 
condition for two constitutive matrices to give the same characteristic equation. This problem is unsolved so far; 
however, in the next section we will find a partial answer by determining a group action on the set of all constitutive 
matrices that leaves the characteristic equation invariant. 
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IX. SL(2,R) ACTION ON CONSTITUTIVE MATRICES 

We consider the group SL{2,M) in terms of its natural representation by 6 x 6 matrices, 

'al bl 



SL{2,R) = { \ad-bc=l}. (73) 

This group acts on the set of aU real 6x6 matrices by conjugation, i.e., each element Q e SL{2,M) maps each 
constitutive matrix M onto M' = Q^MQ. In terms of 3 x 3 blocks, the group action reads 









JO 











' a^K + ac{x + 7) 
abK + bcx + o-drf + cdu 




(74) 



It is obvious that this group action leaves the determinant invariant, det(M') = det(M), that it maps symmetric 
matrices onto symmetric matrices and that it preserves the difference of the off-diagonal blocks, x' ~ l' = X~ 1- 
The following calculation shows that the group action leaves the characteristic equation invariant. 



det(,ol-(^;^. -^^^)q-MQ; 

det(pol - Q (pip '%^') Q'^m) = (75) 



p 



M 



In the first step we have used the Sylvester identity according to which det(sl — AB) = det(sl — BA) for all scalars 
s and all n X n matrices A and B. In the second step we have used that 



cl al) [ PpAP ) \bl alj^y ppAP 



(76) 



if ad — 6c = 1, as can be quickly verified by multiplying out the the left-hand side. 

As S'L(2,R) is 3-dimensional, the orbits of the group action must be of dimension < 3. To calculate the dimension 
of the orbits we have to differentiate the group action. A quick calculation shows that the tangent space to the orbit 
through the matrix M with 3x3 blocks according to ([T5|) is spanned by the three matrices 

--(s:). --(^:^o)- <") 

El, E2 and E3 are linearly independent unless one of the three matrices k, v and 7 -I- x is zero and the other two are 
linearly dependent. This demonstrates that the group action foliates a dense and open subset of the set of all real 
6x6 matrices into three-dimensional orbits. 

The S'L(2,R) transformations on constitutive matrices contain two interesting special examples. The first is the 
one-parameter family of transformations with a = & = and b — c^^ which corresponds to the so-called reciprocity 
transformations. By definition, a reciprocity transformation is a transformation Fab ^ —■^Hab, Hab ^ C^ab of field 
strength and excitation, where is a nowhere vanishing pseudoscalar field. A reciprocity transformation changes 

E^^n, B^-^D, H^-CE, D^CB, (78) 

and, thus, the constitutive matrix according to 

; J) - ?i) ■ <™' 

This is precisely the transformation produced by the SL{2,M.) element with a = b = and b = = (. So our 
result contains as a special case the fact that the characteristic equation is invariant under reciprocity transformations 
(cf. Hehl and Obukhov Q, pp 273). Note, however, that in the case of a nonlinear constitutive law a reciprocity 
transformation changes the argument of the constitutive matrix. 
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The second interesting special case is the one-parameter family of transformations with a = d ^ 1 and c — 0. This 
corresponds to adding an axion field, i.e., to a transformation Fab '-^ Fab, Hab '-^ Hab + (pFab with a pseudoscalar 
field <j). This transformation changes 

E, B, D + (j)B, n ^ n - (jjE , (80) 

and, thus, the constitutive matrix according to 

\1 1^ J \j + (f>K + 0(7 + x) + 

This is precisely the transformation produced by the SL{2,M.) element with a = d=l, c^O and h = (p. We have 
thus reproduced the known fact (see Hehl and Obukhov 6], p. 265) that adding an axion field does not affect the 
characteristic equation. 




X. REDUCTION TO 3 DIMENSIONS 



On the left-hand side of the characteristic equation pop we have the determinant of a 6 x 6 matrix. If the impermittivity 
matrix k, or the impermeability matrix f is invertible, this may be reduced to the determinant of a 3 x 3 matrix. We 
give the derivation for the case that n is invertible. We may use the decomposition ([T5|) of the constitutive matrix. If 
we feed this into the characteristic equation ((30)) . and apply Sylvester's formula that det(sl — AB) — det(sl — BA) 
for all scalars s and all n x n matrices A and B, we find 



= det Pol - L . : .... I J L .. (82) 





^ V -ppAPK Pol ) ' ^ ' 

We now use the well-known rule (see e.g. that for any n x n matrices A, B, C, D 

det ^ ) = det(AD-BC) if CD = DC . (84) 



C D 

This puts the characteristic equation into the form 

= det(p2 + popp{AP-fK-^ - K-^xA") +PpPaAP{u - 7a^"'x)A" ) . (85) 
If u is invertible, an analogous calculation results in 

= det(p2 _ poPpiAPxi^-' - u-'jAP) + PpP.AP{k - xi^'^l)^." ) ■ (86) 



Note that, by a reciprocity transformation ([79l) . equation (|85|) transforms into (l86|) and vice versa. Thus, if both v 
and K are invertible, ([85]) and (|86l) are indeed equivalent forms of the characteristic equation. 

If the constitutive matrix (fT5|) is invertible, (|85|) and (|86|) are equivalent to the form derived by Graglia, Uslenghi 
and Zich [20], cq. (7) and (8), apart from the fact that they considered only linear constitutive laws. If the cross-terms 
X and 7 vanish, they reduce to the form of Damaskos, Maffett and Uslenghi [2l[, eq. (7). An alternative version of 
the characteristic equation, for linear constitutive laws without cross-terms, was derived and discussed by Itin [23 ] . 



XI. INVARIANCE UNDER TIME AND SPACE INVERSION 



In general, the characteristic equation is not invariant under time inversion {pa,Pi,P2,P?,) ^ i~PQ,Pi,P2,P3)- Sim- 
ilarly, it is not invariant under space inversion {po,Pi,P2,P3) '-^ (POi —Pi, —P2, —Pa)- However, owing to the homo- 
geneity of the characteristic polynomial, it is invariant under combined time and space inversion, (pojP1j?'2,P3) '-^ 
i~Po, ~Pi, ~P2, —Pa)- This implies that the characteristic equation is invariant under time inversion if and only if it is 
invariant under space inversion. It is easy to see from (j30|) . and even more obvious from (|85p or (j86p . that a sufficient 
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condition for invariance under time inversion is that the magneto-electric cross-terms x ^^id 7 vanish. However, this 
is not necessary. 

Clearly, the characteristic equation is invariant under time inversion if and only if its roots coincide pairwise up to 
sign, UJ4 = —oji and ut^ = —oj2- From ([7T|) we read that this is true if and only if L'' = and L^""^ = 0. In this special 
case (|7ip reduces to two second order equations for Lof and w| which can be solved easily. The characteristic equation 
reads 

= pI{pI-u;,{p)^){pI-u;,{p)^) (87) 

with 



Wl/2(P)' = \ppPaLP^ ±J^PpP„PrPx{LP--^ - ^LP^L-^^ 



Thus, the necessary and sufficient condition for hyperbolicity in the time-symmetric case is that the right-hand side 
of ((88l) is real and non-negative, i.e. PpPaL^"' > and 

^{ppP.LP-f > PpP.PrPxLP^^^ > \{ppPaLP^f (89) 
for all p in IS? . If, in addition, we want to prohibit zero- frequency modes, we have to strengthen the condition 

PpPaLP'' > to 

PpP^LP" > Q foraU p^^. (90) 



It would be desirable to rewrite (|89p and (j90| as conditions on the constitutive matrix (|T5|) . However, it is hard to 
see how this can be done in a practicable way. 

Equation ((87)) . with (l88l) and (|90p . is the general form of the characteristic equation for the case that we have 
two real double-cones which are mirror-symmetric with respect to time inversion and that zero-frequency modes are 
prohibited. Each of the two double-cones is the null cone of a Finsler metric 

2 OPaOpb 

More explicitly, the time-time, time-space and space-space components of the two Finsler metrics gi and 52 read 



1 JlppPaPrPxiLP'^^^-kLP'^L-^) 
9l% = - 1 > 5?;^ - , 5^;^ = l^P^ ± ^ 1.^^^^ ■ (92) 

The conditions ([5^1 and (I^Hl) guarantee that each of these two Finsler metrics has Lorentzian signature at all p 7^ 0. 
For such Finsler metrics a Fermat principle was proven in [23| . Note, however, that in this article the Finsler light 
cones where assumed to be smooth everywhere (except, of course, at the vertex). This is not the case with the metrics 
([5^ . At points where the square- root in has an isolated zero the two light cones form conical singularities. The 
resulting phenomenon of "conical refraction" was already mentioned in Section [VII 



XII. CONDITION OF NON-BIREFRINGENCE 

If there are two different real roots uji{p) >Q and uj2{p) > 0, we have birefringence in the forward direction; similarly, 
if there are two different real roots oJ^ip) < and W4(p) < 0, we have birefringence in the past direction. In this 
section we want to investigate the condition for non-birefringence. To that end we consider the case that the four 
roots wi, LU2, 0J3, and UJ4 pairwise coincide, U!i{p) = uj2{p} and oJsip) = ^4(7?) for all p S M'^. For the time being we 
do not require that the roots are real. With the help of our assumption that the roots pairwise coincide, it is easy to 
solve the first two equations of ((7T|) . 



^iMp) = IppL" ±^\ppPa {LP- - \lpL-) . (93) 
By a generalised Galilean transformation we can always transform LP to zero, recall ([7^ . so that ([55]) simplifies to 



wi/sip) = ±\l\ppPaLP- . (94) 
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Note that time-symmetry is then automatically satisfied. As a consequence, the reduced characteristic equation reads 

{po - {po - LoM? = {pI ~ \LP'^PpP<r? = . (95) 

Thus, the characteristic variety is the null-cone of a quadratic form, i.e., our assumption that the roots pairwise 
coincide excludes proper Finsler structures. The coefficients L'"^ that determine the quadratic form depend on x and, 
in the case of a non-linear constitutive law, also on D{x) and B{x). This result is true independent of whether or not 
we require hyperbolicity, i.e., independent of whether or not the roots (|94p are real. 

By ([M]) , hyperbolicity is satisfied if and only L'"^ is positive semidefinite. This gives a Lorentzian or a degenerate 
quadratic form. The degenerate case is excluded if we require that uja{p) 7^ for A = 1,2,3,4 and p 7^ 0. Thus, 
the condition of non-birefringence necessarily leads to a Lorentzian null cone if we require hyperbolicity and exclude 
zero-frequency modes. 

These findings corroborate earlier results found by Hehl and Lammerzahl [23| and by Itin [2^ for linear constitutive 
laws. They give a satisfactory answer to the question of what kind of light cones are possible in the case of non- 
birefringence. However, it would also be desirable to have a condition on the constitutive matrix that is necessary 
and sufficient for non-birefringence. Such a condition is still to be found. 

XIII. THE SYMMETRIC HYPERBOLIC CASE 

Recall that the evolution equations are symmetric hyperbolic if and only if there is a matrix S such that 

S^VpL^S = [S-^ppLP^f (96) 
for all p in R^, where (•)'^ denotes transposition. With L'' from p7|) . (|96p takes the form 

We want to give a characterization of the symmetric hyperbolic case in terms of the constitutive matrix M. To that 
end we use the following result which is based on a simple Schur lemma type argument. 

Lemma 5. If U and V are real 3x3 matrices such that 

PpAPU = VppAP (98) 
for ah p € M^, then U = V = cl with some c e R. 

Proof. For any py^ in R^, the matrix PpAP has a one-dimensional kernel spanned by p. Thus, by applying (1^5]) to 
multiples oi p we see that U maps every one-dimensional subspace into itself, hence U = cl. Then (j98|) takes the form 
(V — cl)ppAP = 0. If p runs over R'^, the image oi PpAP runs over R^. Thus, our last equation requires V = cl □ 

With the help of this lemma, we prove the following proposition. 

Proposition 5. The evolution equations are symmetric hyperbolic if and only if the constitutive matrix M is (i) the 
zero matrix, (ii) positive definite, or (iii) negative definite. 

Proof. By applying Lemma [5] to each of the four 3x3 blocks of (j97|) we find 

with real numbers a,b,c,d. As the left-hand side of is the transpose of the left-hand side of (jlOOp . this can be 
true only if a = d = and b — c, i.e. the necessary and sufficient condition for symmetric hyperbolicity is that there 
exists an invertible matrix S such that M = c(SS"^)~^. This is true (i) with c = if and only if M = 0, (ii) with 
c > if and only if M is positive definite, and (iii) with c < if and only if M is negative definite. □ 
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Clearly, the case M yields the characteristic equation Pq = and is physically uninteresting. Thus, M must be 
positive or negative definite to give symmetric hyperbolic evolution equations. In the case of a linear constitutive law 
this condition is equivalent to the assumption that the energy density w = ^{EpD^ + HpB^) is positive or negative 
definite. In particular, the positive (or negative) definiteness of M requires that k, and u are positive (or negative) 
definite, i.e., that we have positive (or negative) definite permittivity £ = and permeability /x = 

Proposition [5] generalises a result that was derived in 26], Section 2.1. There only linear constitutive laws were 
considered and a Lorentzian metric was presupposed. It was shown that, if magneto-electric cross-terms are absent 
and permeability and permittivity are positive definite, the evolution equations are symmetric hyperbolic. The above 
result shows that the definiteness condition is not only sufficient but also necessary and that this result (i) carries 
over to non-linear local constitutive laws, (ii) can be formulated without reference to a background metric, and (iii) 
remains true if magneto-electric cross-terms are allowed. 



XIV. EXAMPLES 
A. Biisotropic media 

A medium is called biisotropic (at a point x) if there is coordinate system such that each 3x3 block of the constitutive 
matrix is a scalar multiple of the unit matrix (at x), i.e. 



with scalars ft, i/, x^ 7- If this is true with X = 7 = Oi the medium is called isotropic. We want to find a neccessary 
and sufficient condition for a biisotropic medium to yield hyperbolic evolution equations. 
With (|10ip . the characteristic equation becomes 

det(p2i _ poppA''(x-7) + PpPaAPA^{>iiy-x"/)) = (102) 
as can be read from (|85|) or ((86|) . The determinant can easily be calculated, resulting in 



[{pI - \p\^n,.-xi)y +Pl\p\'{x-ir) =0. (103) 



Hyperbolicity requires that all roots of this equation are real, which is obviously the case if and only ii x — 1 
and Kv — XI > . In the last inequality we replace the > sign by a > sign, as the = sign only gives zero-frequency 
modes. With this unphysical zero-frequency case omitted, we can thus say that a biisotropic medium yields hyperbolic 
evolution equations if and only if 

X = 7 and det(]V[) > . (104) 
In this case the reduced characteristic equation has two roots of multiplicity 2, 



wi(p) = i02{p) = -uJz{p) = -iOiip) = IpI Vdet(M) . (105) 

There is no birefringence, i.e., we have a unique future light cone and a unique past light cone. This double-cone is 
the null cone of the Lorentzian metric 



= - 1 = g''" = v/det(M) S"" (106) 

and, of course, also of any metric that is conformal to this one. 

Note that (|104|) is equivalent to the requirement that the matrix (jlOip is (positive or negative) definite. From 
Section IXIIII we know that then the evolution equations are symmetric hyperbolic. Hence, for a biisotropic medium 
the condition of hyperbolicity is equivalent to the condition of symmetric hyperbolicity. For an isotropic medium 
(|104p reduces to the condition that n ~ e^^ and v — must have the same sign. 



B. Born-Infeld theory 



Born-Infeld theory was introduced by Born and Infeld in 1934 [27]. The motivation was to modify standard vacuum 
electrodynamics in such a way that the field energy in a small ball around a point charge is finite. This was achieved 
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by assuming a non-linear vacuum constitutive law of the form Hab ~ dL/dFab, where L is the Born-Infeld Lagrangian 

L = —\/b^ + b^FabF'^'' — *FabF'^''. Here one assumes that, as in standard vacuum electrodynamics, a spacetime 
metric of Lorentzian signature is given: the star is the Hodge operator defined by the spacetime metric and latin 
indices are raised and lowered with the spacetime metric. 6 is a constant of nature, called the "absolute field" by 
Born and Infeld. 

As we allowed for non-linear constitutive laws throughout, Born-Infeld theory fits perfectly well into the general 
scheme considered in this paper. In the following we will apply the results of the preceding sections to Born-Infeld 
theory, thereby deriving the structure of the characteristic variety (i.e., of the light cones) in the Born-Infeld theory 
and establishing the result that the Born-Infeld theory admits a well-posed initial- value problem. None of these results 
is new. (The light cones of the Born-Infeld theory were determined, e.g by Boillat [2^; a proof that the Born-Infeld 
initial- value problem is well-posed can be found e.g. in Serre (29|.) However, the derivations given here are quite 
different from the ones available in the literature and illustrate the general results given above. 

Our first goal is to demonstrate that, for initial values given on a hypersurface that is spacelike with respect to the 
spacetime metric, the Born-Infeld initial- value problem is well-posed. To that end we choose coordinates (x°, x^,x'^,x^) 
such that the chosen spacelike hypersurface is given by the equation .t" = constant. In addition, we may assume that, 
at some particular point on the hypersurface, the coordinates are pseudo-orthonormal with respect to the spacetime 
metric. This leaves he freedom of orthogonal transformations of the spatial coordinates on the tangent space of the 
chosen point. Then, at the chosen point, the Lagrangian takes the form L = - y/lATT^(WB~WE^J^~(WB^ . 
Here and in the following, greek indices are raised and lowered with the Kronecker delta. At the chosen point, the 
constitutive law reads 

D^^^= , ^^i^^ + B-i^.i^^ (107) 

^ TP /Ta I 7 9 / 7->,, 7-> 7-7?,, -rr? \ / m,, ?-> ^2 ' ^ ^ 



dB^ ^fe4 + b^B^^B^ - E^^Ef,) - (E^^B^f ' 

Clearly, in the limit 6 — ?> oo the non-linear equations (|107l) and (|108p tend to the linear standard vacuum constitutive 
law = E^ and Ti,^ = E'^. (When comparing our equations (jl07l) and (|108|) with the corresponding equations on 
page 437 in the original Born-Infeld paper [2^, note that there is a sign error in the latter.) 

Equations (|107|) and (|108p can be solved for E^^ and "H^, which demostrates that our coordinates are admissible 
in the sense of Definition [TJ The resulting equations, which are found after an elementary though rather tedious 
calculation, read 

= Ih- = TF ((^' + B'^B^jD'^ - B-D^Bf^) (109) 

= 1^ = 4 ((6^ + D''D,)B^ - B'^D.D^') (110) 
oB^ W ^ ' 



where 



W{D, B) = J{b^ + BPBp){b^ + D-D,) - (B^Dr)^ (111) 



is the Legendre transform of L{E, B) with respect to the pair of variables E, D, i.e., W = Efj^D'^ — L. Now the 3x3 
blocks of the constitutive matrix take the form 

= ^^3"^"^ (wHc^p - {b" + BPBp)D^D,i - [b" + D^Dr)B^Bp + B^Dx{B^Dp + BpD^) 

d^W _ {b^ + D-Dr) 
""^ - dB'-dBf^ ~ (62 + BPBp) ^''^ ' ^ ' 

d^W _ B^Dr BaDfi-BfiDa 

7a/3 - X0a - Q^aQuP ^ + BP B pf''^ ^ W ' 

K is symmetric, so it has three real eigenvalues with orthogonal eigenvectors. As our coordinate system is fixed only 
up to orthogonal transformations of the spatial coordinates, we may choose the coordinates such that k, is diagonal. 
The eigenvalues ki, K2 and K3 of k are 
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As they are strictly positive, ^^ is positive definite for all (Z5, B) in M^. 

Now we make a coordinate transformation (a;°, x^, x^, x'^) i-> i^, a;^, x'^) that induces at the chosen point a 
generalised Galilean transformation pi)) and (HSI) with 

a = b^i = diag(V^, V^, V^) , c = det(a) = , = ■^e'^'^'^B^L'^ . (114) 

By (gg]), this transforms (fTT2)) into 

By (US]), the constitutive matrix takes the form 

As this matrix is obviously positive definite, Proposition [5] proves that, in the twiddled coordinates, the evolution 
equations are symmetric hyperbolic, so the initial-value problem is, indeed, well-posed. 

Note that the coordinate transformation was necessary for achieving our goal. The Galilean boost with v"' had the 
effect of killing the antisymmetric part of 7 = ■ In contrast to M, the original constitutive matrix M was not 
positive definite for all values of {D, B) but only for (i5, B) in a certain neighborhood of the origin in R^. 

Finally, we want to calculate the Born-Infeld light cones. In the twiddled coordinates, the characteristic equation 
(|85l) takes the form 

Thus, there is no birefringence (cf. Boillat (28j): the Born-Infeld theory determines a unique past and a unique future 
light cone, given by the equation 

G PaPb - -P, + (^2+ 5,5^)2 - • (118) 

In the original coordinates, in which the light cone of the spacetime metric takes the form p^ = Pi + P2 + vh (|118p 
reads 

PaPb h /,2 I uuu \2 \ ^ ^ ^ ^ • (11^) 

With the vP from (|114p and the from (|113p the equation for the Born-Infeld light cones becomes 



{Wpo - eP^-p,BM' ,2. 2^ 2^ 2^ D-D^rB^B^, . 2. [BPBp-P-P^Y , .^.o ^w 2 2. 

52(^2 ^ B>^Bx) " iPl+P2+P3) ^ {Pl+P2)-\j 1 + {D-Bry (Pi - P2) 

(120) 



Conclusions 



In this article we have considered Maxwell's equations with a local constitutive law and we have found some useful 
results. In particular, we have derived several versions of the characteristic equation and we have worked out a 
method of how to calculate its roots; moreover, we have conveniently characterised the class of all constitutive laws 
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that give symmetric hyperbolic evolution equations. However, symmetric hyperbolicity is not necessary for well- 
posedness of the initial-value problem. If we want to characterise the class of all constitutive laws for which the 
initial-value problem is well-posed, we need a criterion for strong hyperbolicity. This is an open problem. It would 
also be desirable to characterise all constitutive laws that give hyperbolic evolution equations. Again, this is an open 
problem. We were able to characterise the light cones in the case of invariance under temporal or spatial inversions 
and in the case of birefringence; however, we could not find a condition on the constitutive matrix that is necessary 
and sufficient for either of these two properties. We have found a certain group of transformations that act on the set 
of all constitutive matrices and leave the characteristic equation invariant; however, we could not determine the set 
of all such transformations. So there are a lot of open problems that should be addressed in future work. 

Note added in proof: After this paper was submitted the author learned about SchuUer, Witte and Wohlfarth [soj , 
Ratzel, Rivera and Schuller [sij , and Favaro and Bergamin [32| where important related results were found. 
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